In recent years , reinforcement learning has become one of the key research areas in artificial intelligence and machine learning and it has attracted many researchers in other fields including operations research , control theory and robotics . reinforcement learning is different from supervised learning in that no teacher signals are needed and a reinforcement learning system learns by interacting with the environment to maximize the evaluative feedback from the environment 增強學(xué)習(xí)與監(jiān)督學(xué)習(xí)的不同之處在于,增強學(xué)習(xí)不要求給定各種狀態(tài)下的期望輸出即教師信號,而強調(diào)在與環(huán)境交互中的學(xué)習(xí),以極大(或極小)化從環(huán)境獲得的評價性反饋信號為學(xué)習(xí)目標(biāo)。
Through analysis of transcript data of a 90 minutes - class from the five excellent teachers , the following features are derived : 1 ) the talking time is not fully dominated by teachers , so students also have their share of talking time ; 2 ) more referential questions occur than display questions ; 3 ) most exchange structures are complex while irf structure still exists ; 4 ) there are more discoursal feedbacks than evaluative feedback 結(jié)果表明其特征如下: 1 )課堂話語時間并非全由教師支配,同時也有學(xué)生參與會話的時間; 2 )參考性提問普遍多于展示性提問; 3 )大部分會話結(jié)構(gòu)較為復(fù)雜,但是irf結(jié)構(gòu)仍占一定比例; 4 )話語性反饋比例略高于評價性反饋。